Prototype-based minimum classification error/generalized probabilistic descent training for various speech units

نویسندگان

  • Erik McDermott
  • Shigeru Katagiri
چکیده

In previous work we reported high classiication rates for Learning Vector Quantization (LVQ) networks trained to classify phoneme tokens shifted in time. It has since been shown that the framework of Minimum Classiication Error (MCE) and Generalized Probabilistic Descent (GPD) can treat LVQ as a special case of a general method for gradient descent on a rigorously deened classiication loss measure that closely reeects the misclassiication rate. This framework allows us to extend LVQ into a prototype-based minimum error classiier (PBMEC) appropriate for the classiication of various speech units which the original LVQ was unable to treat. Speech categories are represented using a prototype-based multi-state architecture incorporating a Dynamic Time Warping procedure. We present results for the diicult E-set task, as well as for isolated word recognition for a vocabulary of 5240 words, that reveal clear gains in performance as a result of using PBMEC. In addition, we discuss the issue of smoothing the loss function from the perspective of increasing classiier robustness.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Minimum Classification Error Training of Hidden Conditional Random Fields for Speech and Speaker Recognition

Hidden conditional random fields (HCRFs) are derived from the theory of conditional random fields with hidden-state probabilistic framework. It directly models the conditional probability of a label sequence given observations. Compared to hidden Markov models, HCRFs provide a number of benefits in the acoustic modeling of speech signals. Prior works for training on HCRFs were accomplished with...

متن کامل

Discriminative Training of GMM for Language Identificatio..

In this paper, a discriminative training procedure for a Gaussian Mixture Model (GMM) language identification system is described. The proposal is based on the Generalized Probabilistic Descent (GPD) algorithm and Minimum Classification Error Rates formulated to estimate the GMM parameters. The evaluation is conducted using the OGI multi-language telephone speech corpus. The experimental result...

متن کامل

Discriminative training of naive Bayes classifiers for natural language call routing

In this paper, we propose to use a discriminative training(DT) method to improve naive Bayes classifiers in context of natural language call routing. As opposed to the traditional maximum likelihood estimation, all conditional probabilties in Naive Bayes classifers (NBC) are estimated discriminatively based on the minimum classification error (MCE) criterion. A smoothed classification error rat...

متن کامل

Simultaneous Design of Feature Extractor and Pattern Classifer Using the Minimum Classification Error Training Algorithm

~ Recently, a minimum classification error training algorithm has been proposed for minimizing the misclassification probability based on a given set of training samples using a generalized probabilistic descent method. This algorithm is a type of discriminative learning algorithm, but it approaches the objective of minimum classification error in a more direct manner than the conventional disc...

متن کامل

Simultaneous Design of Feature Extractor and Pattern Classifier Using the Minimum Classification Error Training Algorithm

~ Recently, a minimum classification error training algorithm has been proposed for minimizing the misclassification probability based on a given set of training samples using a generalized probabilistic descent method. This algorithm is a type of discriminative learning algorithm. but it approaches the objective of nlininlumclassification error in a more direct manner than the conventional dis...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Computer Speech & Language

دوره 8  شماره 

صفحات  -

تاریخ انتشار 1994